Unicode, ucs-2, ucs-4, UTF-16, utf-32, UTF-8
Unicode details
Copyright Notice: It can be reproduced at will, but the original author charlee and original link http://tech.idv2.com/2008/02/21/unicode-intro/must be indicated in a timely
With regard to the relationship between Unicode and UTF, it can be simple to remember: Unicode is a coding organization, a coding specification, in the Java middle Finger utf-16;utf is a Unicode encoded translation conversion format, so as to be
The original objective of Unicode is to use a 16-bit encoding to provide ing for over 65000 characters. However, this is not enough. It cannot cover all historical texts or solve the implantation head-ache problem, especially in network-based
Use directvobsub as the subtitle plug-in for the player.
Convert subtitles to UTF-8 no Bom format. The subtitles are garbled during playback.
Convert subtitles to UTF-8 BOM format. The subtitles are normal during playback.
It seems that
In PHP, you can use the mb_detect_encoding () () function to determine whether a string is a certain encoding, the specific use of methods and considerations are as follows:
Note that to use the Mb_detect_encoding function, you must turn on the
Using a UTF-8 file written in java can be read correctly, but if you use NotePad to save the same content in UTF-8 format, when a program is used, one invisible character is read from the file, and one more?
Instance:
Create a text: txt text with
You need to read the TXT file uploaded by the user in the project a few days ago, but you are not sure about the character set of the TXT file.
UTF-16, UTF-8 (with BOM), Unicode can be different based on the first three bytes
Public String
Generally speaking, Unicode encoding systems can be divided into two levels: encoding mode and implementation mode.
1.Encoding Method
Unicode is a character encoding scheme developed by international organizations to accommodate all texts and
This statement can be reproduced at will, but the original author charlee and original link http://tech.idv2.com/2008/02/21/unicode-intro/must be indicated during reprinting.
Basic knowledge
Differences between byte and
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.